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This listing of claims will replace all prior versions, and listings, of claims in the application. 
Listing of Claims: 

1 . (Currently Amended) A method for providing pre-processed data for the training of 
mining models from data set training data comprising at least one set of case data, each set of 
case data of said sets at least one set of case data comprising a stored value for at least one 
variable from among a set of at least one variable, comprising: 

determining at least one mining structure variable from among said set of at least one 
variable; 

for each set of case data, retrieving a stored value for each of said at least one mining 
structure variables from said data set training data; 

performing mining model initial processing on said retrieved values; and 
storing the results of said mining model initial processing. 

2. (Currently Amended) The method of claim 1, wher e wherein said step of determining 
at least one mining structure variable from among said set of at least one variable comprises: 

accepting creation operation data comprising data comprising the identity of said 
mining structure variables. 

3. (Currently Amended) The method of claim 2, wh e r e wherein said at least one mining 
structure variable comprises a continuous variable, wh e r e wherein said creation operation 
data comprises an indication regarding discretization of said continuous variable, and wh e r e 
wherein said step of performing mining model initial processing on said retrieved values 
comprises discretizing said continuous variable according to said indication. 

4. (Currently Amended) The method of claim 3, wh e r e wherein said indication 
comprises an indication of a number of buckets into which said continuous variable should be 
discretized. 



Page 3 of 15 



DOCKET NO.: MSFT- 1735/303422.1 PATENT 

Application No.: 10/624,278 

Office Action Dated: January 24, 2006 

5. (Currently Amended) The method of claim 3 5 wh e r e wherein said indication 
comprises an indication of sub-ranges into which said continuous variable should be 
discretized. 

6. (Currently Amended). The method of claim 1 , wh e r e wherein said stored results are 
associated with at least one mining model, and wh e r e wherein each of said at least one 
mining model is trained using said stored results. 

7. (Currently Amended) A tangibly embodied computer readable medium comprising 
computer executable modules having computer executable instructions, said modules 
providing pre-processed data for the training of mining models from data set training data 
comprising at least one set of case data, each of said sets of case data comprising a stored 
value for at least one variable from among a set of at least one variable, said computer 
executable modules comprising: 

a mining structure variable determination module for determining at least one mining 
structure variable from among said set of at least one variable; 

a data set training data retrieval module for each case, retrieving a stored value for 
each of said at least one mining structure variables from said data set training data; 

an initial processing module for performing mining model initial processing on said 
retrieved values; and 

a storage module for storing the results of said mining model initial processing. 

8. (Currently Amended) The computer readable medium of claim 7, wh e r e wherein said 
mining structure variable determination module accepts creation operation data comprising 
data comprising the identity of said mining structure variables. 

9. (Currently Amended) The computer readable medium of claim 8, wh e r e wherein said 
at least one mining structure variable comprises a continuous variable, wh e r e wherein said 
creation operation data comprises an indication regarding discretization of said continuous 
variable, and wh e r e wherein said initial processing module discretizes said continuous 
variable according to said indication. 
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10. (Currently Amended) The computer readable medium of claim 9, wh e r e wherein said 
indication comprises an indication of a number of buckets into which said continuous 
variable should be discretized. 

1 1 . (Currently Amended) The computer readable medium of claim 9, wh e r e wherein said 
indication comprises an indication of sub-ranges into which said continuous variable should 
be discretized. 

12. (Currently Amended). The computer readable medium of claim 9, wh e r e wherein said 
stored results are associated with at least one mining model, and whoro wherein each of said 
at least one mining model is trained using said stored results. 

13. (Currently Amended) An application programming interface tangibly embodied in at 
least one computer readable storage medium for use in connection with providing pre- 
processed data for the training of mining models from data set training data comprising at 
least one set of case data, each of said sets of case data comprising a stored value for at least 
one variable from among a set of at least one variable, wherein said application programming 
interface receives as input creation operation data comprising data comprising the identity of 
mining structure variables from among said set of at least one variable; for each case, 
retrieves a stored value for each of said at least one mining structure variables from said data 
set training data; performs mining model initial processing on said retrieved values; and 
stores the results of said mining model initial processing. 

14. (Currently Amended) The application programming interface of claim 13, wh e r e 
wherein said at least one mining structure variable comprises a continuous variable, wher e 
wherein said creation operation data comprises an indication regarding discretization of said 
continuous variable, and wh e r e wherein said application programming interface discretizes 
said continuous variable according to said indication. 
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15. (Currently Amended) The application programming interface of claim 14, wher e 
wherein said indication comprises an indication of a number of buckets into which said 
continuous variable should be discretized. 

16. (Currently Amended) The application programming int e rfao e of interface of claim 14, 
wh e r e wherein said indication comprises an indication of sub-ranges into which said 
continuous variable should be discretized. 

17. (Currently Amended) The application programming interface of claim 13, wherein a 
said query is sent and said stored results are retrieved via at least one network. 

18. (Currently Amended) The application programming interface of claim 13, wherein 
wh e r e said stored results are associated with at least one mining model, and wherein wh e r e 
each of said at least one mining model is trained using said stored results. 

19. (Currently Amended) A system for providing pre-processed data for the training of 
mining models from data set training data comprising at least one set of case data, each of 
said sets of case data comprising a stored value for at least one variable from among a set of 
at least one variable, said system comprising: 

an application programming interface implemented at least in part by a computing 
device, said application programming interface (a) receiving as input creation operation data 
comprising data comprising the identity of mining structure variables from among said set of 
at least one variable; (b) for each case, retrieving a stored value for each of said at least one 
mining structure variables from said data set training data; (c) performs mining model initial 
processing on said retrieved values; and (d) stores the results of said mining model initial 
processing; and 

a database for storing said data set training data, operably connected with said 
application programming interface, and for returning said stored values to said application 
programming interface. 

20. (Original) A system for providing pre-processed data for the training of mining 
models from data set training data comprising at least one set of case data, each of said sets of 
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said sets of case data comprising a stored value for at least one variable from among a set of 
at least one variable, said system comprising: 

determination means for determining at least one mining structure variable from 
among said set of at least one variable; 

retrieval means for each case, retrieving a stored value for each of said at least one 
mining structure variables from said data set training data; 

initial processing means for performing mining model initial processing on said 
retrieved values; and 

storage means for storing the results of said mining model initial processing. 

21. (Currently Amended) The system of claim 20, wh e r e wherein said determination 
means comprises: 

data acceptance means for accepting creation operation data comprising data 
comprising the identity of said mining structure variables. 

22. (Currently Amended) The system of claim 21, wh e r e wherein said at least one 
mining structure variable comprises a continuous variable, where wherein said creation 
operation data comprises an indication regarding discretization of said continuous variable, 
and wh e r e wherein initial processing means comprises discretization means for discretizing 
said continuous variable according to said indication. 

23. (Currently Amended) The system of claim 22, wh e r e wherein said indication 
comprises an indication of a number of buckets into which said continuous variable should be 
discretized. 

24. (Currently Amended) The system of claim 22, wh e r e wherein said indication 
comprises an indication of sub-ranges into which said continuous variable should be 
discretized. 

25. (Currently Amended) The application programming interface of claim 22, wh e r e 
wherein said stored results are associated with at least one mining model, and wh e r e wherein 
each of said at least one mining model is trained using said stored results. 
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26. (Original) A method for the training of a mining model from data set training data 
comprising at least one set of case data, each of said sets of case data comprising a stored 
value for at least one variable from among a set of at least one variable, comprising: 

determining at least one mining structure variable from among said set of at least one 
variable; 

for each case, retrieving a stored value for each of said at least one mining structure 
variables from said data set training data; 

performing mining model initial processing on said retrieved values; 

storing the results of said mining model initial processing in a mining structure; and 

training said mining model using said stored results. 

27. (Currently Amended) The method of claim 26, further comprising: 

storing conn e ction link data indicating that said mining model has been trained on 
data from said mining structure. 

28. (Original) The method of claim 26, further comprising: 

accepting a drill through query for specified data from said mining structure and 
providing said specified data. 

29. (Currently Amended) The method of claim 26, wh e r e wherein additional mining 
models are associated with said mining structure, and wh e re wherein said method further 
comprises: 

training each of said additional mining models using said stored results. 

30. (Currently Amended) The method of claim 26, wh e r e wherein said mining structure 
is treated as a first class object in a database. 

3 1 . (Currently Amended) A tangibly embodied computer readable medium comprising 
computer executable modules having computer executable instructions, said modules training 
a mining model from data set training data comprising at least one set of case data, each of 
said sets of case data comprising a stored value for at least one variable from among a set of 
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at least one variable, said modules comprising: 

a mining structure variable determination module for determining at least one mining 
structure variable from among said set of at least one variable; 

a data set training data retrieval module for each case, retrieving a stored value for 
each of said at least one mining structure variables from said data set training data; 

an initial processing module for performing mining model initial processing on said 
retrieved values; 

a storage module for storing the results of said mining model initial processing; and 
a training module for training a mining model using said stored results. 

32. (Currently Amended) The computer readable medium of claim 3 1 , said modules 
further comprising: 

conn e ction link data storage module storing conn e ction link data indicating that said 
mining model has been trained on data from said mining structure. 

33. (Original) The computer readable medium of claim 31, said modules further 
comprising: 

drill through module for accepting a drill through query for specified data from said 
mining structure and providing said specified data. 

34. (Currently Amended) The computer readable medium of claim 3 1 , wh e r e wherein 
additional mining models are associated with said mining structure, and wh e r e wherein said 
training module further trains each of said additional mining models using said stored results. 

35. (Currently Amended) The computer readable medium of claim 3 1 , wh e r e wherein 
said mining structure is treated as a first class object in a database. 
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